Outline of MM-GBSA scoring project for protein design (12-26)


[These notes are also copied on the academic wiki since the other collaborative projects are listed there. We have included it here due to its connection to the MM-GBSA sirtuin project.]


RC: Karthik, the NSF sirtuin proposal was oriented towards protein design, but there is no protein design in the projects we have planned. See below for such a project. This can take the place of the project originally proposed using the python protein design code, which we considered a little too complicated for the students. If you agree on offering this project, please start making the slides based on the points I have made below (a couple of slides on background, a couple on approach). The Boas and Harbury paper mentioned below should be in the dropbox - Brad was using it.


MM-GBSA rescoring of the sequence libraries generated in PNAS papers. No resampling of sequences to start - use those already generated in PNAS. RC will provide the files.


Background and purpose: While glidescore performs well at rank-ordering sequences for a given ligand, it does not provide accurate binding free energies. Evidence in literature (e.g. Boas in protein design, others EK has found in drug design) suggests MM-GBSA can also rank order sequences or ligand poses. (MM-GBSA is most useful in drug design for rank ordering cogeneric series of ligands for the same receptor – a problem where ensemble averages for the unbound ligand are important – but in protein design the ligand remains fixed and the receptor changes). There is also evidence that MM-GBSA can predict binding free energies with appropriate experimental calibration, unlike Glidescore. (Here, experimental calibration would require a catalytically inactive substrate analog, and we will not do experiments at this point. “Cogeneric series” used for calibration is a series of sequences. Experiments could later be done on sirtuins.) Accurate calculation of sequence entropies and evolutionary temperatures (assuming Km \approx Kd) -- see the slide on sequence entropies in above slide set -- requires accurate prediction of relative binding free energies. Hence we will rescore the sequences obtained in the PNAS papers using MM-GBSA, with the aim of ultimately being able to computationally predict the changes in binding free energy of the substrate due to mutations, analogously to the manner in which MM-GBSA is being used by EK to predict binding affinities of cogeneric inhibitors.


Approach: Our prior work suggests MM-GBSA may not be as effective as glidescore in rank ordering sequences. Students would read papers on use of MM-GBSA for pose sampling (EK has noted that there are papers in the literature that use MM-GBSA for pose sampling - please try to find them), as well as protein design (Boas) -- these papers suggest that MM-GBSA --is-- effective at rank-ordering, and write a script that would do energy minimization and scoring (using both MM-GBSA and Glidescore) of sequence lists provided by us for several enzymes including beta-lactamase. For MM-GBSA, a single ensemble calculation is needed for each ligand. If MM-GBSA does not rank order the sequences like glidescore, students analyze the breakdown of contributions to glidescore and MM-GBSA scores, scale vdW radii and other free parameters, and explain why the rank orderings differ.


Outcomes: Could result in paper jointly between PMC-AT and CMU, to be followed with a paper applying the python protein design code for sequence sampling. Any code written for ensemble MM-GBSA calculations could be integrated with the python protein design code.

Could connect KM's work for thesis to protein design.

Would give us more time to get cloud resources and associated codes for sampling up and running.



Slides for both Anna talk could be largely borrowed from previous talks, with KM making a few more for the proposal above, while refining his GBSA slides.

Esp useful if MD slides not fully developed, or if issues with MD accuracy.




RC Comments on 9-10 sirtuin draft

I have posted here my comments and plan for our future revisions (in which I will also be engaged) for both computation and experiment. Let us start with the * * points, then the * points, and then EK and I will work to address other computational revisions/additions while XG will revise experimental sections.

Question for discussion (please comment): Due to length should we consider splitting paper into 2? Do not want this to compromise quality of any one paper or to delay submission. Would allow us to include more of our results on computation. Biochemistry is not a computation-heavy journal. Possible breakup:

1) fluorescence assay / docking (no binding affinity estimation or iso-NAM). Possible journal: Biochemistry.
2) continuous assay/MM-GBSA binding affinity calculations, and iso-NAM expts and computation. Possible journal: JMB.

We will not do any more work either experimentally or computationally if we choose to break up. We can decide while making the corrections below, most of which are needed either way.

Computational part

These are to be addressed jointly by RC/EK. Priorities for EK are marked with **. Others will be worked on joinrly

1) * *pg 29-34: Clarify specification of which induced fit protocol was used to get these binding affinities. 4/20, 8/10 of docked structures – refer to the same docking protocol?
2) Pg 34: Justification of customized induced fit protocol – better sampling; if energy are lower the sampling is justified. But mention that enhanced sampling could be improved by an iterative algorithm
3) Pg 42: Conclusion – add proposal to prescreen potential activators by MM-GBSA docking followed by experimental screening
4) * *Do we have the MM-GBSA binding affinities for iso-NAM?
5) Any comments on analysis of uncompetitive inhibitor mechanisms using similar techniques?
6) * *Bottom of pg 10: further description in methods section on why docking scoring functions rather than MM-GBSA are used for sampling poses (be aware of length).
7) * *Related to 6), a breakdown of the components of a Glidescore and comparison to the MM-GBSA scoring function.
8) (RC) pg 42: Brief discussion of stationarity issues w MCMM and possible future improvements
9) (RC/EK; later) (pg 42) Description of equations for ensemble calculations to be put in computational methods section
10) pg 29-35: Ensemble calculation results: showing <E> estimate is improved by ensemble sampling but the T\Delta S correction is small/negligible
11) pg 14: Some brief description of the plop backbone sampling method desirable (this is not an exhaustive search)
12) pg 11-15: Order of paragraphs in methods section – alternates between induced fit/customized induced fit, etc
13) pg 10; Some more discussion in methods section on why ensemble calculations on bound states as not as important as / are more challenging than ensemble calculations on the free ligand (methods for sampling joint protein/ligand degrees of freedom, differences in density of states)
14) (RC) pg 23-29: introduce calculation of Kd from Km, kcat experimental results and compare to experimental predictions
15) * *Discuss normalization of MM-GBSA binding affinities. EK to post a paper on this. We will probably just mention that normalization can’t be done here but could be done with a larger dataset on sirtuins/sirtuin inhibitors.

Let us 1st address the * * parts then RC/EK will jointly address the remaining issues and write up together (RC will have some input on how to write up the remaining pts).

Experimental part

1) (pg. 39-40) Any reason to include further details on iso-NAM activation based on “Mechanism of SIRT1 activation” document? There is already some discussion here. Add discussion on base exchange reaction rates only if splitting into two papers
2) Clearer indication of when the fluorescence vs continuous assays are being used – which plots correspond to which assay?
3) Pg 39-40: abrupt transition between inhibition theory and iso-NAM discussion. Also, are we contrasting iso-NAM with allosteric activators like resveratrol here? Should be stated.



(RC): On the latest manuscript draft: a) we should briefly mention how plop optimizes side chains, since the difference with respect to minimization may not be clear to readers. Make sure to refer to rotamer optimization and explain what a rotamer is. b) It is stated that the induced fit protocol only optimizes side chains (not backbones). Is this right? In any case, we used loop optimization in our modified protocol. Do we refer to the loop prediction algorithm in plop anywhere in the paper? c) Excluded volume constraints are mentioned. I don't believe they were used for SIRT3. If they were used for Sir2, we could perhaps be more clear as to precisely where.

Raj



August 23, 2012 (Thu)


(RC) Enumerate the roles of the computational (sampling) tools described (in the wiki 8-10 update and in the latest [[#|draft]]) in future [[#|studies]] aimed at answering questions posed in the tasks in “Sirtuin Kinetics Paper [[#|draft]] 5-16-2012” that we decided not to do in this paper.

(EK) Here are the hypotheses from the “Sirtuin Kinetics Paper [[#|draft]] 5-16-2012”. Most of them have been answered, at least with docking and MM-GBSA binding affinity estimates. The ones that could not be done involve hypothesis 3 because the current methods with docking do not allow testing of induced structural changes beyond the localized environment of the docking area (through Induced Fit or other loop / side chain minimization).

(RC) To what extent have these been integrated into the draft?



(RC) Mention the possibility of induced fit docking of the Sirtris uncompetitive inhibitor of SIRT3; mention that since this is an uncompetitive inhibitor, based on the definition of uncompetitive from XG's section, there must be a conformational change that occurs upon NAD+ binding to the AC pocket.
srt1720.jpg

(EK) Raj, by this, you mean SRT1720, shown here? It is a quinolinecarboxamide compound that acts as a potent, reversible and NAD+ uncompetitive and AceCS2 (Acetyl-CoA synthetase 2) peptide-competitive allosteric inhibitor of SIRT3 activity. In uncompetitive inhibition, the inhibitor (SRT1720) binds exclusively to the ES (protein-NAD+) complex, rather than to the free enzyme form (apo-protein). When the substrate (NAD+) binds, a conformational change occurs in the enzyme which forms or unmasks the inhibition site. The resulting enzyme-substrate-inhibitor complex is catalytically inactive. So the answer to your statement is: Yes, there must be a conformational change that occurs upon NAD+ binding to the AC pocket. It may be possible to search for SRT1720 docking sites with a docking program or with induced fit, using the SIRT3-NAD+ complex. However, since there is no crystal structure of this complex, it is more challenging to create the proper structure using various computational techniques. So far I have docked NAD+ into SIRT3 by minimizing certain sterically clashing side chains and backbone atoms starting from the trapped thio-imidate intermediate. This is the closest structure to the SIRT3-NAD+ complex. But this intermediate may not contain the proper binding site for the SRT1720 inhibitor, because that very inhibitor probably prevents this intermediate imidate from forming.

(RC): Yes the reason I mentioned this is that we are discussing tools (e.g. induced fit) that allow us to determine conformational changes that occur upon NAD+ binding. These same tools are undoubtedly important in studying the mechanism of action of uncompetitive inhibitors, due to the definition above of uncompetitive inhibitors. We now know a little bit about the conformational changes that occur upon NAD+ binding. We should mention this in the paper (while indicating the caveat that larger scale conformational changes may occur that are responsible for the mechanism of action of SRT1720).





Aug. 21, 2012




A note on the AB pocket conformation of NAD+: it can adopt two flipped conformations. Previous descriptions of the NAD+ in the AB pocket crystal structure showed the amide only flipped towards the phosphates. There is another chain in the multimeric crystal structure of 1YC2 which has the amide of the nicotinamide end of NAD+ flipped the other way. This is important because some of the docked structures have the amide flipped out, which we now know is also seen in the crystal structures.
1YC2.AB.flipped.amide.jpg


Docking NAD+ into AB and AC pockets after PRIME refinement: no constraints
Maestro Project Name: redock.after.prime

SIRT3.NAD.ACpocket.jpg
Above Figure: 8 out of top 10 (based on emodel glide score) docked the NAD+ into the AC pocket. Green are these 8 molecules. Red is the NAD+ in the AC pocket from the co-crystallized structure of 1YC2:B. Pink are the 2 structures from 1YC2 (chains A and D) with NAD+ in the AB pocket. The amide from the nicotinamide is pointing in both directions.

SIRT3.NAD.ABpocket.jpg.jpg
Above Figure: 4 out of the top 20 (based on the emodel glide score) docked the NAD+ into the AB pocket. White are these 4 molecules. Similar to the other figure with AC pocket docking from this same simulation, Red is the NAD+ in the AC pocket from the co-crystallized structure of 1YC2:B. Pink are the 2 structures from 1YC2 (chains A and D) with NAD+ in the AB pocket. The rank order of these 4 structures was 11, 13, 17, and 18 with RMSD to the superimposed 1YC2:A NAD+ of 2.18, 1.82, 2.17, 2.48 Å, respectively.


Docking of iso-nicotinamide and nicotinamide into SIRT3
SIRT3.isoNAM.and.NAM.docking.jpg.jpg
Above Figure: Docking of iso-NAM and NAM without constraints into 3GLR, human SIRT3 with acetyl-lysine AceCS2 peptide, but no NAD+ or intermediate co-crystallized. The same PRIME refined structure as used in the above NAD+ docking was used. Glide XP produced only the above shown poses, which were not exactly aligned with the nicotinamide end of the backbone aligned 1YC2:B NAD+. Note that the iso-NAM has the amide end pointing in a different direction than the NAM. It is not clear why the two inhibitors adopt different conformations from the superimposed nicotinamide end of the NAD+. It could be that the side chains (or backbone?) in the C pocket in the crystal structures from SIRT3 are slightly different than that of the C pocket in Sir2 which has NAD+ co-crystallized in the AC pockets. Recall that when NAM was docked into this Sir2 structure, the NAM almost exactly aligned with the nicotinamide end of the NAD+. At a minimum, the iso-NAM and NAM docked in the general C pocket volume, which is an open void allowing much wiggle room for these small molecules. I could do a more careful comparison between the C pockets of Sir2 with NAD+ in the AC pockets and this SIRT3.

A similar docking simulation was done with 3GLT (that has the ADPR thio-intermediate co-crystallized), which resulted in slightly poorer results with respect to RMSD to the reference nicotinamide of the reference NAD+ from 1YC2:B. The top ranked Glide XP iso-NAM and NAM still docked within the region of the C-pocket, but farther away from the reference. The pocket was slightly deformed because of the trapped thio-acetyl ADPR intermediate. For this reason, I chose to use 3GLR, which only has the acetyl-lysine peptide co-crystallized. Unlike for Sir2, no crystal structures with NAD+ bound (not intermediate) are available, so 3GLR was the next best option.




Comment on priorities among remaining tasks listed in the above document (esp iso-NAM docking). XG's draft has a section on iso-NAM. Indicate whether you did iso-NAM docking. Does it bind in the C pocket? See my comments on XG's Mon tasks regarding iso-NAM.

Yes, iso-NAM docking into SIRT3 was done. See Docking Simulations wiki page from Aug. 21. Yes, it binds in the C pocket when using Glide XP.

You asked us to expend our current conclusions to indicate rational design of more effective activators than iso-NAM based on computational studies - e.g., prediction of reaction rates for iso-NAM and NAM dissociation from the C pocket. The possible key to isoNAM's activation of Sir2/SIRT3 in the presence of the more potent nicotinamide inhibitor is isoNAM's ability to antagonize nicotinamide binding to the sirtuin. Nicotinamide inhibition has been shown to be linked not only to binding in the C pocket, but also to reversal of the reaction of the intermediate imidate to the re-formation of the NAD+. IsoNAM is an unreactive isostere, that potentially binds to the C pocket, but cannot participate in the reversal of the reaction because of the shifted location of the nitrogen on the pyridine ring of nicotinamide. This shift disrupts covalent binding to the imidate intermediate, preventing reformation of NAD+.

One path to computationally design these types of inhibitors is to balance increased C-pocket affinity with design of compounds that are non-reactive with the imidate intermediate. A more potent C-pocket binding, but non-reactive species to the imidate, must also be able to leave upon the next reaction cycle - either by movement in the C-pocket due to the deaceylated peptide leaving or the the product 2’-O-acetyl-ADPR leaving. Alternatively, one could imagine a high affinity activator leaving when the next NAD+ that bound to the AB pocket shifts it's nicotinamide end, booting out the activator. While these possible mechanisms are conjecture, this activator must, at some point dissociate from the C pocket so the NAD+ can react with the acetyl-lysine peptide.

You asked us to contrast iso-NAM and [[#|resveratrol]] as Sirtuin activators. It is not clear to me how resveratrol works, so this is difficult to answer.





Aug 16, 2012




Identify flexible loop regions that could be sampled instead of minimized in AB pocket of SIRT3. Sample with PRIME.
SIRT3.loops.jpg
From SIRT3, 3GLT, the above figure shows the important flexible loop regions around the B pocket that are responsible for the steric clashing. The red molecule is the NAD+ in the AB conformation (not co-crystallized, but structurally aligned), while the aqua are the main side chains interfering with the B pocket. The grey backbone trace shows important loop regions that can be sampled. The top loop is GLY319 to GLH325, and the bottom is PHE157 to PRO160. The method involved superimposing the AB structure NAD+ (red molecule) onto the 3GLT crystal structure, then using PRIME to sample around the the NAD+. PRIME sampled the loops and moved the backbone and side chains out of the B pocket. The next step would be to re-dock NAD+ into this PRIME optimized structure. A similar method was used for Sir2.


Outline the steps in the Induced Fit Docking (IFD)

The below outline of IFD is from: Sherman, W., Day, T., Jacobson, M.P., Friesner, R.A., and Farid, R. (2006). Novel Procedure for Modeling Ligand/Receptor Induced Fit Effects. Journal of Medicinal Chemistry 49, 534–553.

The overall procedure (outlined in Figure 1) has four steps:
  1. initial softened-potential docking into a rigid receptor to generate an ensemble of poses;
    1. challenge in 1st step is to generate at least one reasonably docked pose for the ligand (independent of the score it receives). Without a plausible initial guess for the ligand pose, any attempt to predict reorganization of the protein structure is unlikely to succeed in the context of a limited allotment of CPU time.
  2. sampling of the protein for each ligand pose generated in the first step;
    1. challenge in the 2nd step is predicting the low energy receptor conformation for a correct ligand pose, starting from the “plausible” initial guess generated in the first step
  3. reducing of the ligand into low energy induced-fit structures from the previous step;
    1. challenge in the 3rd step is to generate low energy ligand conformations when presented with the correct receptor conformation.
  4. scoring by accounting for the docking energy (GlideScore), and receptor strain and salvation terms (Prime energy).
    1. challenge in 4th scoring step lies in properly ranking the complexes such that the top ranked pose correctly predicts the ligand/ receptor structure.

IFD.protocol.jpg
Figure 1: IFD flowchart. ∆E is the energy gap from the lowest energy structure



Comment on the challenges inherent in identifying activators via docking methods (e.g., one may need to know the rate of dissociation of iso-NAM,NAM)
A part of the docking studies was to explain the activation by iso-NAM in the presence of an inhibitor, and to explain the differences between Sir2 and SIRT3 activation by iso-NAM. One of our hypothesis about how iso-NAM activates Sir2 in the presence of the NAM inhibitor is that iso-NAM and NAM both bind to the C pocket, competitively, but iso-NAM has a higher dissociation rate (it leaves easier). Some of the challenges in using docking to identify activators:







Aug. 9, 2012



Discussion of past problems with SIRT3 AB docking and future direction.

Docking of NAD+ into the AB pocket of SIRT3 is challenging. A number of techniques with traditional docking (Glide) and Induced Fit, including various constrains, have been tried without success. Details of these simulations have been provided previously. This document focuses on causes of post problems and future directions with SIRT3 and the AB pocket. There are twomain approaches in moving forward: (1) additional traditional docking and induced fit, and, (2) advanced molecular dynamics techniques. For these techniques, two significant issues are sufficient ligand pose sampling and sidechain/backbone sampling to alleviate steric clashing.

An alternative argument is that AB docking in SIRT3 it too high in energy or sterically hindered in the real system. This result is partially supported by the experimental data that nicotinamide is a competitive inhibitor of NAD+. The non-competitive mechanism of Sir2 is that NAD+ can dock into the AB pocket while the inhibitor occupies the C pocket. When this inhibitor leaves, the nicotinamide of NAD+ rotates from the B pocket into the C pocket for productive binding. For SIRT3, competitive inhibition implies that NAD+ docking to the AB pocket is unfavorable or impossible when the inhibitor is present in the C pocket. Thus, the simulations failing to dock into the AB pocket are expected. Barring severe steric hinderance making docking impossible, we would like to dock the NAD+ into the AB pocket, even if it is a high energy pose, so that we can get the MM-GBSA estimated binding affinity in the AB pocket to compare to the AC pocket binding.

Before I discuss possible solutions for SIRT3 simulations, here is a summary of the difficulties in previous simulations. The below left picture, previously described, shows residues in the B pocket in SIRT3 (PDB:3GLT; aqua residues) sterically hindering the nicotinamide end of NAD+ (molecule in the center colored by element), compared to residues in the B pocket in Sir2 (Sir2Af2, PDB:1YC2; red residues). This picture shows that at least 3 residues are sterically blocking the NAD+ from docking into the B pocket. A critical issue is whether these clashes can be alleviated via side chain sampling and ligand pose sampling, or does it require larger backbone movement which Induced Fit will not accommodate. There may be backbone movements necessary that are larger than what can be done in Induced Fit. The below right picture shows the possible backbone movements that may be necessary to open up the B pocket. I created this by running a minimization with NAD+ harmonically constrained in the AB pockets (pose from the co-crystallized structure of Sir2 with NAD+ in the AB pocket), and minimized the structure of the protein around the NAD+. The backbone needed to move > 1.7 Å. This test was not comprehensive, because, for example, minimization does not properly sample side chain rearrangements. However, it suggests that backbone movement is a real problem.
SIRT3.Bpocket.obstructions.jpg
Interestingly, when I ran Induced fit again, using this new structure which had an active site minimized to accommodate the harmonically restrained NAD+ in the AB pose, the protocol still failed to re-dock NAD+ into the AB pocket. This result suggests there is a ligand pose sampling problem, whereas previously there may have been both a ligand pose sampling and a backbone/side chain sampling problem. What I suspect is that the scoring function rejects poses in the B pocket due to unfavorable energy. In this case, we know that the NAD+ should dock in the AB pockets, because that is the starting structure that was minimized. However, the scoring function focuses on poses with better scores. I tried to increase the number of output poses to > 1000 (after clustering), thinking that the AB pocket pose would show up, but just at a worse GlideScore. It did not. It is possible to increase the number of output poses even more to 10,000 or 100,000, and rank order these poses based on RMSD to the input AB pocket pose (Induced Fit and Glide can do this rank ordering). Note that this ranking does not affect the type of poses that are output based on the same internal sampling and GlideScore thresholds. It simply sorts the output.

There are probably backbone/sidechain sampling issues, as well, for this energy minimized structure. If Induced Fit comprehensively sampled the backbone and side chains, it should find a receptor structure into which the NAD+ can dock into the AB pocket with a reasonable GlideScore that would appear in at least the 1st 1000 output structures. But because of limited backbone/sidechain sampling, this receptor structure and ligand pose is not found.

Induced Fit's limited constraint control is frustrating. Glide has the following constraints that are not available in Induced Fit: Atom/residue positional constraint, hydrophobic constraints (hydrophobic regions of receptor that are important), excluded volumes, and ligand torsional constraints. We could alter the Induced Fit script to include other Glide constraints like excluded volumes, atom position constraints, ligand torsion constraints. Excluded volumes and atom positional constraints would focus the sidechain/backbone and pose sampling to the AB pocket. Induced Fit is a Python script that iteratively and alternately calls Glide then PRIME. In theory, the rest of the Glide constraints should work, but have just not been coded into the Induced Fit script. Note that adding these constraints may be more complex, as it may complicate the grid generation during each iteration. On the downside, it may take some time for me to hack the code. Schrodinger may be interested in improving their program.

Raj asked how excluded volumes focus pose sampling. It's not that the use of excluded volumes generates more cumulative poses. Rather, it allows the poses that are generated to focus on away from excluded volumes to volumes of interest. Without the excluded volumes, poses in those interest areas may not pass the threshold GlideScore, and would automatically be pruned from the tree before additional pose sampling.

Simulated annealing molecular dynamics is an alternative approach. We've been focusing on using docking programs to estimate binding affinity. Docking is typically done when the ligand pose and/or (for induced fit) active site side chain arrangement is unknown. However, our case involves knowing approximately the pose (in the AB pocket), but not knowing the side chain and backbone rearrangement. A more appropriate simulation would be to run molecular dynamics with simulated annealing (or something similar) to allow the active site to relax around the AB pose. Some flexibility would be allowed in the ligand. After the trajectory has relaxed and stabilized, a collection of low energy conformations could be input into MM-GBSA.

The next [[#|step]] would be to use a more sophisticated molecular dynamics protocol. With MM-GBSA, a single structure of the ligand and receptor is used to estimate binding affinity. The more sophisticated MD protocols use an ensemble of structures from a trajectory, which is theoretically more accurate in predicting binding affinity. More computationally intensive molecular dynamics based simulations are linear interaction energy (LIE), thermodynamic integration (TI) or free energy perturbation (FEP). I'd like to use Desmond and have contact with Huafeng, so the particular method we choose depends on its availability in Desmond. Huafeng said that he did binding affinity calculations with Desmond with FEP.

The other issue with a computationally intensive MD simulation is what computational resources do we have. Could I run a job on many nodes of PMC's or Carnegie Mellon's cluster for a week? Alternatively, we could set up Desmond to run in the cloud using Amazon's EC2. We would use the same platform by Cycle Computing that Schrodinger uses. Huafeng said that Desmond should run on the cloud. Cycle Computing takes care of things like data routing, error handling, and various types of automation necessary for running a scientific computing environment on Amazon’s virtual servers.

See the todo list on Aug. 9, 2012 here for summary points on the next steps.




July 31, 2012




Figures of intermolecular contacts between the Sir2 protein and the nicotinamide of NAD+ in the B and C pockets.


Sir2Af2 NAD AB pocket B pocket.jpg
Above figure: There are NO hydrogen bonds between the nicotinamide end of NAD+ in the B pocket and the protein. The 3 shown residues were the only ones within 4 Å of the nicotinamide. The rest of the protein is to the lower left. The B pocket is open to solution above the nicotinamide. The only depicted H-bonds (dashed lines) are not in the B pocket.




Better Figures for Intermolecular Protein-Ligand Interactions: 2D interaction diagrams

The below diagrams are flattened 2D representations of the protein-ligand interactions. In the diagram, residues are represented as colored spheres, labeled with the residue name and residue number. The colors indicate the residue (or species) type:
Interactions with the protein are marked with lines between ligand atoms and protein residues:
Ligand atoms that are exposed to solvent are marked with gray spheres. The protein “pocket” is displayed with a line, colored with the color of the nearest protein residue. The gap in the line shows the opening of the pocket.


Sir2Af2 (1YC2 chain A) NAD+ in AB pocket
ligand.interaction.diagram.Sir2Af2.1YC2.chainA.AB.jpg
Above figure: Intermolecular protein-ligand diagram of Sir2Af2 (1YC2 chain A) with NAD+ in the AB pocket. Protein and water residues within 3.0 Å of the NAD+ are shown. The lack of a protein "pocket" line around the nicotinamide end and the grey spheres around those atoms indicate that the nicotinamide end is exposed to solvent. The B pocket (show as a yellow circle) is a crevice open to solvent, while the C-pocket (lower yellow circle) is protected by a loop from the solvent (shown in the next diagram). The C-pocket is empty or collapsed in this structure. Also note that there are no H-bonds or other specific intermolecular interactions between the protein and the nicotinamide end of NAD+.




Sir2Af2 (1YC2 chain B) NAD+ in AC pocket
ligand.interaction.diagram.Sir2Af2.1YC2.chainB.AC.jpg
Above figure: Intermolecular protein-ligand diagram of Sir2Af2 (1YC2 chain B) with NAD+ in the AC pocket. Protein and water residues within 3.0 Å of the NAD+ are shown. Unlike in the AB pocket, the NAD+ molecule is completely surrounded by protein residues in the entire A and C pockets. The nicotinamide is not exposed to solvent, unlike in the B pocket. The approximate location of the B pocket is shown due to distortions created by transforming the 3D protein-ligand picture into a simple 2D diagram. There is a backbone H-bond between the protein ILE102 and the nicotinamide end of NAD+.


C pocket for SIRT3 to be added.





July 30, 2012



Sir2 Cross docking:
Some questions have been raised about the results and protocol used for the cross docking studies of Sir2.
The Cross Docking Protocol:
  1. These are tests in which the protein conformation is from the co-crystallized structure with NAD+ originally in the AC pocket. The NAD+ is removed, and then an attempt is made to re-dock NAD+ into the AB pocket.
  2. To prevent the NAD+ from re-docking to the AC pocket instead of the AB, an exclusionary volume is placed in the C pocket
  3. Possibly additional exclusionary volumes surrounding areas neighboring the B pocket are added if the nicotinamide end of NAD+ does not go into the B pocket (as observed from the co-crystallized structure of Sir2 with NAD+ in the AB pocket) with the single exclusion volume in pocket C.

Two sets of cross docking stud

ies were done for
induced.fit.Sir2Af2.jpg








MM-GBSA results for Sir2Af2

Here are the results for in place Glide XP scoring for cross docking of the AB vs. AC conformations of NAD+ from Sir2Af2 (1YC2):
Origin of crystal structures:
Procedure for in place docking of the NAD+ in the AB pose into the protein structure from the AC conformation:
  1. above AB and AC crystal structures were merged.
  2. All protein and other molecules were deleted from the AB structure, except for the NAD+ in the AB conformation and a few key waters
  3. All ligands (NAD+ in AC conformation) and conflicting waters were deleted from the AC structure
  4. Structure was constrained energy minimized with OPLS2005
  5. Glide grid was created
  6. in place scoring with GLIDE XP
  7. MM-GBSA calculation done from within PRIME.

Vice versa was done for in place docking (scoring) of the NAD+ in the AC pose into the protein structure from the AB conformation.
Results (XP Glide Score):

Glide XP
MM-GBSA ∆G bind
AB NAD+ pose into AC protein
-11.425
-92.553
AC NAD+ pose into AB protein
-11.220
-95.161
The Emodel score was 0.00, possibly because no docking was done, only in place scoring. This is a guess.

Discussion
Results likely [[#|agree]] with hypothesis 4 for Sir2.
Hypothesis 4: If NAM is a noncompetitive inhibitor of NAD+, then NAD+ will dock/bind with approximately equal binding affinity to the AB or AC binding pocket. Thus, if NAM occupies pocket C, NAD+ will equally bind to pocket AB.

The two Glide scores for the AB vs. AC binding are very similar, which agrees with the hypothesis. However, the GlideScores are not a reliable indicator for binding affinities or rank ordering the ligands. Thus, we use slightly better measure, the MM-GBSA ∆G_bind, which is calculated as:
dG(1) = E_complex(minimized) - ( E_ligand(minimized) + E_receptor(minimized) )
MM-GBSA is used to estimate relative binding affinity. The absolute values calculated are not necessarily in agreement with experimental binding affinities, thus I am not sure if the units should be kcal/mol. However, the ranking of the ligands based on the calculated binding energies can be expected to agree reasonably well with ranking based on experimental binding affinity, particularly in the case of congeneric series. The two MM-GBSA ∆G_binding of -92.6 and -95.2 are reasonably close to support the original hypothesis 4.







Eric

Comments:

1) The proposal for how to dock to the SIRT3 AB pocket is identical to what I previously saw in an email. If this docking does not work,

the remaining steps do not apply and we will just have those two numbers from MM-GBSA applied to Sir2. As mentioned I will need to see

much more detail in terms of connecting up with the notes from the previous attempts we made, and how things will be done differently in each case this time.

Please connect up as much as possible with the specific details we previously discussed at length.
Also, there are several offline tasks that we discussed that I believe are still pending. I mean the structural superpositions of the various Sir2 and SIRT3 structures with commentary that will allow XG and I to understand which side chains are clashing, which parts of the loop need to be predicted, etc. The latter could be put in the paper irrespective of whether the docking works.

2) Regarding the manuscript, you indicated that you will be placing certain methods descriptions within particular sections of the paper. Why not do some of this now, to whatever extent possible, so we can move along with the manuscript?

3) I agree with the general strategy of motivating MD studies in this paper (and planning out certain aspects of, as you indicated in your wiki task list), but probably not including them in this paper. We will touch base on this again after the pending tasks with docking and induced fit have made more progress.

Raj

Docking Simulations Notes





July 13, 2012

GlideScore vs. MM-GBSA: does the GlideScore contain some implicit solvent calculations?

Short answer: GlideScore does not contain an accurate model for solvation effects to estimate protein-ligand binding affinity, while MM-GBSA estimates solvation effects with a continuum solvent model and a term for solvent accessible surface area.


The GlideScore may contain some solvent effects as describe in the Glide paper (Friesner et al., 2004). The GlideScore does include some explicitly added waters around charged polar groups of the ligand and protein, as well as accounting for possible trapped waters in hydrophobic pockets (which is unfavorable). Glide does this by docking explicit waters into the most favorable poses for the ligand, then scoring these with a "water-score". This description is from the 2004 Glide paper, but it is not well documented in the current Glide manual. I'm not sure if these salvation effects are still in the current 2012 version of GlideScore. Either way, the solvation effects are not similar to how MM-GBSA estimates solvation effects.

MM-GBSA estimates salvation effects differently than the extremely limited solvation estimates in GlideScore (Lyne et al., 2006). MM-GBSA incorporates solvation effects in to the binding energy estimates (ΔEbind):
ΔEbind = ΔEMM + ΔGsolv + ΔGSA Where:

Friesner, R.A., Banks, J.L., Murphy, R.B., Halgren, T.A., Klicic, J.J., Mainz, D.T., Repasky, M.P., Knoll, E.H., Shelley, M., Perry, J.K., et al. (2004). Glide:  A New Approach for Rapid, Accurate Docking and Scoring. 1. Method and Assessment of Docking Accuracy. J. Med. Chem. 47, 1739–1749.

Lyne, P.D., Lamb, M.L., and Saeh, J.C. (2006). Accurate Prediction of the Relative Potencies of Members of a Series of Kinase Inhibitors Using Molecular Docking and MM-GBSA Scoring. J. Med. Chem. 49, 4805–4808.







July 13, 2012
Plan for continued simulations
  1. MM-GBSA calculations for static docking poses of Sir2 from the previous results. Recall that the previous calculated docking energies agreed with the what was expected from experiment. However, publishing the docking energies would not be defendable in a paper, while MM-GBSA energies have at least one leg to stand on. These calculations will not take long.
  2. SIRT3: getting docking to work. How? A number of tweaks or alternative steps could work. I just need to find the correct local energy minimum structure for the AB pocket docking of NAD+. Once we have that, subsequent simulations can start from this structure. A few possibilities:
    • adjust the soft core potential to allow closer contacts so docking can occur in the AB pocket
    • Do a more comprehensive testing of induced fit. Previously I tried a handful of different setups. This time I could try more. Recall how Brad got DOCK to successfully work for some test cases - he systematically adjusted the parameters to minimize the RMSD to the crystal structure. A similar strategy could work here.
    • Do a constrained minimization, where the NAD+ is kept frozen in the AB pocket, and allow the protein to adjust around it. Subsequently use PRIME to optimize the side chains for this new conformation. Compare the backbone to the starting structure and to other Sirtuins co-crystallized in the AB conformation.
  3. After successfully docking NAD+ in the AB pocket of SIRT3, a number of the hypothesis can be tested, as previously described in detail in the Sirtuin Kinetics paper draft I wrote (DropBox:PMC-AT Research / Eric / Sirtuin Kinetics Paper draft 5-16-2012.doc) Many of these involve the difference in the calculated binding energy of NAD+ between the AB and AC pockets, as well as effects that the nicotinamide inhibitor has on those differences. In particular, how will these new simulations connect with the previously planned simulations described in the document?
    1. Hypothesis 1a for SIRT3 can now be done. This is a simulation to test if NAD+ preferentially docks to AC pocket vs. AB pocket. Now, with the docking results from SIRT3 with NAD+ in the AB pocket and the AC pocket, and subsequent MM-GBSA calculations on these docked complexes, the MM-GBSA binding affinity estimate between the AB and AC poses will be compared. The hypothesis here is that the binding affinity is higher for AC than for AB.
    2. Hypothesis 1b is similar to 1a, but for Sir2 instead of SIRT3.
    3. Hypothesis 2 and 3: these test if there are any structural changes induced by the nicotinamide inhibitor upon binding to the C pocket, which would affect the binding affinity of NAD+ to the AB pocket. Docking protocols alone will not answer this question. A full MD method would be best to model protein structure changes upon binding of an inhibitor. An intermediate modeling approach proposed here involves using docking and side chain / loop prediction modeling (PRIME + Glide = Induced Fit). 1st the inhibitor is docked into the C pocket. Then, side chain and loop minimization with Prime relaxes the inhibitor-protein complex. Finally, NAD+ will be re-docked to the AB pocket to find how the inhibitor-protein complex affects binding of the NAD+.
    4. Hypothesis 6 and 7 are about where and how various inhibitors dock. Some work has been previously done on these hypothesis. More work here will be cleaning up and summarizing results, as well as a few extra simulations.
  4. Paper: you asked for a break down of where in the paper the computational sections will go. Results and methods will describe the simulation details, while a section of the discussion will describe the benefits and limitations of the docking simulations with the MM-GBSA. In the area describing further research, a few remarks of future work to address those limitations will discuss more advanced molecular dynamics based techniques.
  5. Docking of a small collection of inhibitors that Xiangying has tested or plans to test. I cannot dock a large library of inhibitors, but I can dock a smaller library of dozens to thousands. There is a list of inhibitors that would be interesting to dock and calculate binding energies with MM-GBSA.
  6. Plan and execute a molecular dynamics simulation to calculate binding energy. No details here yet. Only pursue after complete above.






July 3, 2012

Discussion: How to correlate protein-ligand docking scores or other more advanced simulations techniques to binding affinity?

A presentation is available here:
[[#|DropBox]]:PMC-AT Research/Eric/binding.affinity.predictions.ppt
The notes (not ordered) for the power point presentation are here:
[[#|DropBox]]:PMC-AT Research/Eric/binding.affinity.predictions.doc

July 3, 2012 meeting notes
Present on conference call: Raj, Xiangying, Eric
Title: Binding Affinity Predictions